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ABSTRACT 



The purpose of this study Is to present and illustrate the 
application of a general llneaj? model for the analysis of teat netw(>rks 
^a^ed on Rasch m<?asurement *mddels. Test networks can be used to Vertically 
'^"^ eqqate a set of tests which coyer a wide range of difficulties/ /The cri- 

terla of coherence and consistency are proposed in order to assess the 

/ • • . / 

• - adequacy of the vertical equating within the test network* The/ method is 
illustrated using a set of standardized readihg tests which are a part of 
the Achievement Series of the CompreheKsive Assessment Prograin (Scott, 
Fbresman and Conq^any, 1980). - 



CONSTRUCTING A TEST NETWORK WITH A RASai MBASURBMBNT MODEL 

The equating of person meaaurem<>nt9 obtained on testa compoaed of 
different itema la one of the major problema encountered In educational 
meaauremei^t. This problem arises whenever we calibrate a set of it^ms 
that have a wide range of difficulties which go beyond a single indivi- 
dual's ability to provide meaningful responses. For exaiiqile, as edi|ca- 
tors we may be interested in tracing an individual's development In 
reading comprehension over thfe elementary and secondary school years. 
It would be extremely difficult to develop a set of test Items or a 
reading passage that would be appropriate for both first and twelfth 
graders. An item would either be so hard thJt everyone fails (except' 
by chancel or easy that everyone succeeds; both of these ctfses pro^de 
no useful inforaatlon that c^tn be used to calibrate the items, ^^v 

In order to deal with this problem, comprehensive achieV^ent test 
series designed to measure achievement over a wide age ^^^^ade range 
are typically composed of several levels and forms wK3h are designed to 
be appropriate for selected age or grade groups^. iThe separate levels and 
form.of the achievement series must then be ^Sated, so that the person 



ability estimates obtained from differe^^^ets of items are comparable 
and c^ be used to represent the lon^^Eion of the individual on one 



unidimensional trait that spans '^he age or grade levels over which we ^ 
wish to trace growth or chaijge. The goal in test equating is to step 
beyond the /specific items contained in the separate test levels and forms 




s 




ardor to^ibtnr^ infbrrnfition on ifi,tftr\t tr»it ftom ,^e.ch individual w« 

Jk . ^ ) ^ 

uro'intoroftod ii\ measuring, jr'^niiichiavwant te^t^aeri^^' 1« compo^iHl 

of itoma callbratad on ^a alngla liiiddimanaional latant trait scala/ t^han 

- - - ^ ' . ' Mi^* 

I it bocdmaa po^aibia to obtain aquivalant aftd comparable aatimatas ofi, ^ 

eaoSi indivljj|mVa Iqcatlon pn this latent trait regardless of form 

linnel. The atsence of thli equating problem, then fa to develop procedures 

! for detemrinlng and tesfting the comparability of the ability estimates 

j obtained sfrom several dl^^rent\tests coraposed of different itc?ms over 

pecifieied dif f icult ^ glt^ge . If the forms* are. designed to measure t^e^^ 

latent trait at similaT^ability levels, the procedure is generally called 

\ - horizontal, equating, e.g., alt-emate forms equatjmg. The equating of ^ 

measurement results obtained on tests of different levels of . difficulty is 

called vertical equating. The purpose of this .papeflHls to develop and 

illiistx)ate a solution to t^e problems enccj^tefed in the vertical etju^ting 

of dn achievement test sfries based on the simplest latent trait model, 

. thei^'Rasch model. ■ / 

background „ e^^^ 

/Various methods haye^been proposed as solutions to the problem of 
vertical equating. The problem was recognized as early as the 1920s 
when Thomdike pointed "out that, ^ . 



With the development of group ^ests and tests for 
use with higher levels of*^ intelligence, it is be- 
coming more and more necessary to /transmute a score 
ob^tained with one test into the score tha?^is 
equiyallett to it in some other te§t. ^ 
^ (thor^dike,'1922, p. 29) 



■ ■ ■ ■ • . . ■{ • . y . 

Thovndllct 'hnwunoMtfcl*' ««orti uilng hislpTQbabU tffTpr nioihod of .icnUng i 

. » ' , " ' . ' ' ' . ■ ' .■ 

Cnioxndiktt, 1932} TrfkM, 1916) tlnivi^ont In a toTita of «nt6l«8 in 
tht 1920* i (ll«iQVtl)(i4 hia iibfolufer tOiUilg noi:ho4 whibh hi pTotioiiod as « | 
solution to tli« pvobl«n of vartleAl «quitlng (Thwatono, .li)is; 1927, 192H).; 
(a«« PlanagAn and Schwours Xldsa] f ot an llluatratlon of tihurst'one ' a method 
of absolutu acaUng appiied to thrf aquat^ng of IntolUg^ncu to«t acqrea) ,' 
Another aolutlon f or thfj v«rtlcal >tiu^ti,ng of acoros la provided 'by th« 
oqulpexicentlle method of vertical equating. More Tocen'tlV, latent trait 
measurement theory has been recommended aa'a source Solution to trhe 
"lhtractable'»^problem of i«quatin^ (Lord, 1977; Marccj, > 19lf7; tifrlght, 1977; 
Rasch, 1960; Wright, 1967; Wtight and Stone, 1^7^) , A g^^ dpaX of^' ^ ^ 
tecent attention in the psychometric literature has bejbiy direct e^^TEoWarc^ 
assessing the adequacy, of the Rasch model fp^j^ydrtlca^^ and cam-, 

parison of' the various available methods /fot vertit:al o^uatf^g (Wright^, ' 
* jlwi BentZ; aitd Bashaw-, 1977, 1975? Slinde aAd Unnv 19787r|ai9; Kolen, 

•con<^*R^iQm oufifr tne adlMuaG^ theVRasqh model for' the vertical equating 
The conflicting cpncluslpns oveitf tlie adequacy of the Rasch mddel fjbr 



venicat'equa[t5§g -stem " from two majof^>s^irces. The first sourCe 6£ conflict 
involves^^he robustnfiss;^ of the lias^h model . If^e t^st items do not f it the 
Rasch model, then- a vertical equating based on thes^ items will lead ' " ^ 
to an unsatis^ctory equatinjjj^jj^e^ Ijey here is to^s$ Rasch test develop- ^ 
ment strategies in order to produce a set of these iteins which have the \ 
property of specific objectivity (Rasch, 1966a, 1966b), \lf specif ic^objecti|ity 
in Basch's sense is obtained, then the pers6n-f|?ee calibration of items / 
cin be' achieved. This characteristic is always, an hypothesis that must be 
specifically tested 4n each measuring'' situation. Slinde and tihn (1978, 1979) 



Mid Loyd mui Hopv«it (1980) mti9i mm^\i te«t Xum thut mvn not daisiigniii4 
to fit tl)# R#jich moduli • Anoth^ii* jioura«i t\\f^t my iccount fp%^ ihti i^^onfllct- 
itig conclunloin^ U th^ p^abUm of ^iNating Girtt^^^ii Iot d«t#rsijinlng th# 
ttdtquncy: of mch m#thod for th<ii vdrtioiil ^^ujiting ot a off tetktn. . 

Thdr« 1» no 9lngl« ohj active criteria for cowpitrlng th«> restulta of 
«qulp«i'c«htil« and latent trait squating. Th« tTadltlonal critorkin that 
"two acoraa^ one on form X and tho other on form Y (whtwe X and Y mea-, 

. 3ur0 the siimd funct ion with tha ^m^ dograo of rtiUablUty) , may b« 
considered equivalent if their correaponding percentile ranks in either 
group are equal" proposed by Angoff (1971, p. S63) or the criterion 

. proposed by Lord (1977, p. 128) that tests X and Y can be considered 

equated if and only If it is a matter of indifference to each 

^^^^arainee whether he is to take tdst X or test Yi" may unfairly bias |he 
conclusion about the adequacy of a method for vertical equating. 

In this study, the tests that are to be equated^ were developed 

" according to Rasch test development procedures. The iteips were pilotdd 
in a representative national sample and items were selected that fit the 

' Rasch model. In this case, a single linking constant based on common 
items should theoretically provide equivalent person ineasurl!^ments 
regardless of test level or form. i 
The issue of whal; criteria to^ use to assess the adequacy of the 
*verttcal equating is one th^^requires further attention. For the* 
purposes of this study," the Adequacy of the Rasch model is defined in 
terms of the consistency and coherence of the linking constants within 

: the test network. Wright (1977) in his discussion of of t;est networks 
conceived of these networks as being composed of building blocks based 



i 

U About StS/CNjjKjj ^^ijl^i^ ^ ^23*^33^ which thia N's* ar^ tha C4ilbra«. 
tion «am)|)l«i siUea anU K'a tha uuiiubdr olf It^msi la «>iich link (Wrlaht m(\ 
Stom, 197S>)- Thiji logUrcan uxtctulmi to^vqr othc»r trlangUa ulf tqata 
within th« t0!it network. The aucco^a q( tho tajif notworjc la then asstJas^il 
on tha basla of a .nori^s of co nslatenc )^ checkn pdrformmi on all poajilbla 
trlj^nglea. In coinplet© teat networks, I.e., jietji of common Item^i ure avIiU- 
able for all tests In the network, this solution and procedure seems workable. 
In constructing test networks with a Rasch measurement' model designed to ver-^ 
tlcally equate. tests over a wide range of difficulties, situations are encoua- 
tared where common Items are not available for linking every form and level, 
ThJLs leads' to Incomplete test networks in which no direct information is avail 
able^to estimate the needed linking constants, ^ ThiSs is not an InsurmountaMo 
problem and procedures have been developed to estimate links where no direct 
Information is available (Wright and Stone, 1979). The method proposed in 
this paper is not intended to supplant these methods, but it is intejided to 
provide a comprehensive approach with information about the overall coherence 
and consistency of a test network, while at the same time providing a method 
for estimating missing linking constants and providing a test of fit for' each 
of the observed linking constants in the test network, Thd method can be used 
with complete and incomplete test networks, 

The method proposed in this paper begins with. the matrix of linking con- 
stants which -have already been deveToped for the tests where common items are 



tini»r m^^l fair han4V(n)i fl^nlni aml|iiii\^ hy ilMX) m\A 

liflwyrily iii#4 with imirtd ao^pftion ditii crhiiritan«i^ %\\\^^ 
thii tMtirictta {>iroiliic«d tri |i«ilr«d csom]piiYtiion H»ip<»rtiamit;^ iiv^ silmtlur t|i torn 

iugf «ttii itii^lf At^ ^ u«ii|ftii ^pprodCh ta th«» «itiiiln«i€ Ion off th<t av«irtill 

pro«ch to tho «5ittw4tlon of mtjist^ig linking constant ^ in lncoinpl«t«i t«?st 
networks.' 

Tfii» purpo»« of t\\i% paper to illustrate the application of thU 
general linear moilel a$ a technique for examining the fit of a test net- 
>^work lAlch.can bit used aa an alt(pmative crltarlon fo)f aaseaalng whether or 
not the Rasch raoddl provldesi an adequate solution to thd problem of ver-. 
ileal equating. The aasuraptloji ^3 made that if the itema wit>iin each tent fit 
th'ii Rasch model (an assumption that is explicitly tested), then a single 
link^^ng constant provides suffjLclent informarMon for obtaining equivalent 
person ability estiraateis regardless of test or form. The problem then is 
to assess the coherence and consistency. of the network based on these linking 
constant s^^sing th^^general linear model proposed in this paper. If the ob- 
iserved 1 inking /(^onst ants fit the model, then the criterion of consistency is 
met ^d an adequate vertical equating has been accomplished. 

Method 

A General Li^nelr Model for Examininj^jMMtfe^ 




Let represent the linking constant for equating test3 i and j, This^ 
linking constant is a function of the differ^enc^ between the difficulties of 



ttti i,^ 4^, and itil ^.^ 4j» 'l(%4» bn i», 



whi»r«i X Id 4 coiuwii V0ctur uf tho (m(m l ubsiai v«a I ink lug tuusic uut a. 
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£ can b# «*tlwiit«d by mliitiniilint tha nrror t#rm, , In ttm uiunl i^n)^ u?iing 
UA«t iquArnn, By Introducing 4 diagonal mutrlx, i), of w«lghtji It isi juisiJiUiU 
to handU IncompUt© t«st ndtworkai. Thisi. Unit jimmr^ai solution l?i obt4ln<id 
by solving tb« followlnji dquAtlon*. 

91 "* - ^i )' I) (A ^ At\) (3) 
or ^ 

1 

whore D is a diagonal matrix with Is for the observed links and Os for the 
missing links. In the case of a complete test network, D is an identity 
matrix. The normal equations are giveit by 

and solving for the test difficulties, 6 , we have 



i* • * I* . « to) 

tug Clin b« ui#4« 

# A A . U ) 

A * m. " 

wh«r« £ differ* from A* by th« ddjolnlng of ii i«ro to ihm Ia^i iO¥ *if 1* 

to r«pr«««nt ihn t#»t Jlfflciilty of th«, tdat l» 03104 to anchor 
th# tftt ii#twtyrk ti i<»ro by tt^f livttlon. 

An 0|jli|^#i^«iit r<i»ldudl» €^ , cun %hmn bn ii#flii#d a^i 

and a standardised residual defined as 

Z - (c - £. )/ S . , " (9) 

where c. Is the mean vector of the resldtials and S Is the standard 

deviation of the residuals. If the data Ats the model, is approxiraately 

nontolly distributed with a mean of zero and a standard deviation of one. 
/ 

In order to test the fit of the data to the model, the standardized 



residuals caa^fae exasiined md any values ^eater than 2 standard errors 
(z > 2) should be examined in depth. These links are su^^pect and ^he 
researcher may wish to eliminate these links and 're-.95timate the predicted 
links. Another approach to the analysis of the standardi^ed^esiduals - 
isVto use a ranltit plot (Tukey, .1962) . Basically, this involves ordering 
the standardized residuals from th^ smallest to thb largest, where i is 
^an in^^ of residuals. The rankits, 

fti, are equal to the 'standard^ notmal deviates. .which 
following proportions .for each i,' 

. V . (3i^lD/(3s+l) ; (10) 

\ J ' . . ' ^ . - • ■ 

A rahkit plot can then constructed with the standardized residuals on the 
vertical axis and the rankits on the horizontal axis, If ' the model fit 
the data, then this plot should be a straight 45<> line. If the residual 
analysis indicates an acceptable fit, then an adequate vertical equating 
has been accon^lished. d 

The steps in examining a test netWbrk are as follows: 

... ' ' r - ■ . ' 

1. Construct an M x M matrix with observed linking . 

constants in the matrix. 

2. Cbnstruct a CM(M-I)] /2 x 1 column vector composed 

of all of the entries above the diagonal ^( which is zero) 
in cell subscscipt order. 

Obtain solutions to equations (6), (7), (8) and (9). 

4. Examine the standardized residuals and determine ^ 
how well the data fits the model. 

5. I£ the fit of the model is not acceptable, then elimi- r^x 
nate misfitting links, and repeat steps 3 and 4. 



^lOTTe'spond to th€t 



6; If an acceptable fit of the model is pbtained, 

then use the estimated linking constants ob- — 

tained through equation (7) for linking the tests. 
^ TMs vecter provided all the linking constants, 

and any test can be chosen at this point as an 9 
anchor test. ' ' ^ 

Description of Test Network Development and Sample 

In dfder to construct the test network analyzed in this paper, nine, 

V 

linking tests with 12 to 36 common items werg developed. Each one of 
these)linking tests contained items from at least two and as many as four 
forms from the Achievement Series of the Comprehensive Assessment Program 
(Scoitt, Foresman and Company, 1980), designed to measure reading achieve- 
ment from pre-kind'ergart en through high school. The^verall network is ^ 
shown in Figure 1. The ^iquares represent levels 4 through 14 in the ^ 
Achievemen t Series; the circles represent the nine linking tests speci- ^ - 

! 

fically createdL for this study; the connecting lines represent sets of ' 
common items / The appropriate levels of the nine linking tests (2,4,7,8,11,12, 
15,16,*^19) were administered to the elementary and secondary school students in 
Huron County, Ohio. The total number of students tested was 3,982. 

BICAL (Wright, Mead and Bell, 1979) was used to test the fit of the ' 
items within each of the 20 tests CH from the Achievement Series^ plus 
the 9 specially created linking tests). The items fit very well, and it 
was not necessary to eliminate any of the items at this stage. The next 
step was to obtain the average difficulty differences for common items in 
adjacent and non-adjacent test levels which will be used as thq linking 
constants. Plots were constructed for all of the linking items, and some 
of these items were eliminated Csee Engelhard 11980] for a descriptio|}, of 
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this procedure). In general, the elimination of connnon items did not 
have very much of an effect on the value of the linking constant. The 
observed linking constants obtained, for the test network are given in 
Table 1. Table 2 gives the number o^f items, . number of individuals, and 
the standard error for each link computed by equation (H)- - 

In drder to apply the linear model to this data, a computer program 
was written using the matrix procedures in SAS, The observed and 
missing links were listed as a 190 x 1 (20 x 19/2^= 190) column yector, 
with zeroes for all of the missing linking constants. An A matrix was 
constructed and a solution obtained following the earlier outlined pro- 
cedure. Table 2 gives the cell subscripts, observed links, number of - 
items, nimber of individuals and standard error for each link. The 
standard error was obtained by the following formula: 

, - SE CX) = 3.5/ Cnk)^ . - (11) 

Table Skives the observed and predicted links, along with an anal)^sis of 
the standardized residuals. Figure 2 gives the rankit plot of the stan- 
dardized residuals. 

Results . 
Table 3 gives the residual analysis for the test network. Figure 1, 
gives the rankit plot for this data. In general, the data seems to fit 
the model relatively well, adding support to the contention that an 
adequate vertical equating has been accomplished. For example, using 
Wright* s triangular analysis of a set of three tests, we see that using 
the observed links for linking tests 15, 16, and 19 the value of the sum 
is .139 (.483 + .482 - 1.104), while using the predicted links the value 
of the sum is - 0.001 (.523 + .540 - 1.064). By this criterion proposed 



by Wright, the estimated linking constants which are based on a considera- 
tion o£ all the data in the linking matrix are even more coherent. Two 
of the standardiz^ed residuals are greater than 2; these are the links for 
tests 4 and 5 (1.304) and tests 4 and 7 (.863). A re-examination of the 
linking plots for these' tests showed a considerable amount of spread in 
these values which should be a straight line, so that these link^ were 
not as well defined as the other links. Since by chance with 31 observed* 
links wfe would ^ect at the .05 level 1.5 (.05 x 31) links to be greater 



than 2, ±his result is nbt too unlikely, and the decision was %de to 
keep these links arid no re-estimation was calculated. 

Table 4 gives estimates of the test difficulties 5^^ centered on the 
last test (620* - '^'^y definition ). In order to illustrate, an alternate 
centering of the test network, column 3 in Table 4 gives the predicted 

V\ ^ ^ ■ - /' : 

linking constants when the test network is centered on test 6 (§6 - 0.0). 
These valx4es can be obtained in two ways. They can be obtained from the 
vector of predicted linking constants (X) o^ they can more simply be ob- 
tained by subtracting the values in S;, from the value of. fig. This* wil;l 
re-center the test network on test 6. Table 4 also gives the set of ini-' 
tial linking constants that were used in the preliminary calit)rat ion of 
'the reading tests in the Achievement Series. These initial values were 
obtained by averaging different possible linking paths and in some cases 
through simply taking the shortest path between two tests and summing 
the necessary observed linking constants. This earlier procedure did not 
take into account ^11 of the linking information that wais available and 
the results differ from the results of the study by an average of .6 of 
a Ipgit which is a significant difference, , 
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Discussion 

In this paper I have proposed, a metliod for assessing the consistency 

and coherence of a test network thant can be^'used^ vertically equate, a set 

> ' ■ ■ ■• • , 

of tests. This paper differs from previous research on vertical equating with 

' ^ ■ 9^ 

the Rasch model in several asjpects. The first difference is that the tests 
that are vertically equated withia^the te^t^ijetwbrk were developed using the 
Rasch model. Recrenf research by Slinde and L^nn^ (197S^197a) and Loyd and 
Hoover (1980) have ^een basically research the robustness of the'Rasch 
model. They have examined how. well the Rasch model fits item, or ^s't dat^ that 
is already available. Any equating and particularly any v.ertical equating that 
is based on misfitting itenjs will not be complel;ely adequate. The key in ^ny 
' type of equating based on' the'^'^arsqh model is to have items and tests that fit . 
the Rasch model, and therefore have the desiralble jiroperties that, are assoclatec 

. - ' - ' , ^' ; \ : - 

with speccific objectivity. .The fit of the items within each test, the fit of 
items in each link and finally the fit of the linking constant within the test- 
network must be^ examined. In tho^e cases whqre the data fits the model, a single 
linking, constant provides 'sufficient informaiion for the ^equating of person " 
measurements obtained form't^sts that vary in difficulty. 

■ ^ • ' ^ . ■ . • * ■ ' . •/ J ^ . ^ ■ V ■ » , 

The method presented in this papeif is simjLlar to. the 'one used by.. Rent z 

^ . Or ■ ' ' ^ ■ . , • . ^ ■ 

and Basljaw (1975, .1977) . Their matrix of linking constants is slightly dif- 

ferent inform from the one analyzed in this paper. They have two linking • 

constants 'for each test, based on separate administrations of the. test s given 

■ ■ ■ » • 
in different time ordbr. (It would be interesting to extend the model g'iven 

in this paper:;to include a test of the signif i^cance of this time order effect.) 



. • • .: . . • ■ ■■ . ■ \ ■ 

'The computation of* row means and the use of.' these means as average test difficul- 

.ties that 6bxv be used as linking constants Is equival-^nt to the values obtained 
by the method propos^^t^n this ^aper» This paper adds to their approach by 
aakihg explicit the general lii^ear model implied by the method used by Rentz, 
Bashaw and Wright. By making this qodel explicit, it is possible to obtain 

4^tests of the fit of each linking constant within the overall test network, and 
also'to bbtain least squares estimated for the' unavailable links in incomplete 

. test. network3. \ ^ - ' 

\ • ' ' ^ ■ ' r " ■ ' ' \. - ■ ■ 

I This paper presents a ^ethod that extends the criteri&n of consistency 

Ijroposed by Wright (1977) for examining these networks. Previous research on 

the; adequacy of the Rasch ftodel for vertical equating has either compared 

reisultis, from different equating methods, or divided the people/ into different 

ability /groups ai(d compared the'' results of the separate calibrations in each 

group. ' The problem with both of /^ese . approaches ij that there is, no single 

objective criterion or any "best** method of equating that can be used as a, 

9t^<|ard criterion, for compard^ng ec^uating methods. 1 have suggested that the 

criterion for assessing the adequacy of a vertical equating using the Rasch 

mq^el\ be, based on a consideration /of the following conditions. The first con- 

ditidn for an acceptable equating /is that the items within each test fit the 

Rasch m*odel; C5ee Wright and Stoiie [1979] for tests of item. fit.) If this is 

true, then a single linking constant -based on common items can be used to ^ 




vertic|H^ Equate the tests. The second condition is that the 'commoti items 



used to compute the linking (^bnstant must be linearly related. *A plot of the 
difficulties , for these common items obtained from the separate tests to be 
equated can then be represented by a straight line with a slo^e of jone. (See 
Engelhard Cl980j for an exafnple of this analysis of the fit of items to the link.) 



9 



16 



The last condition is that the criteria of coherence and cohsi^ency of the link- 

.1 * ^ . - 

ing. constants within the test network must be met for an acceptable equating. 
. ^If these three conditions are met, then an acceptable equating of the tests in 
^ the network has been realized. In the present example, the three conditions 
for acceptable equating were met. 
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Table 1 . Matrix of observed linking constants . 

1 2 3 4 .5 6 7 8 



10 11 li 13 14 15 16 17 18 19 20 



7 

0 1.013 



7" 



0 .015 1.608 .773 - 2.509 - 

0 1.301 - - - - 

0 1.084 2.441 .863 ' 

0 - 1.684 



(Scune as values above diagonal « 
except th'e values are nega'tive] 



0 .567 -. - - - - 
0 .980 - - 2.184 - 

0 .246 1.51 1.323 - ^ 

0 - - - 
0 .172 - 

.0 .844 
0 . 



/ 



0Q3 .614 

0 

0 



1.601 - 
.844 - 

.51(0 - 
0" .483 

, 0 



- \ 



- 1.104- - 
.233 .558 .482 - 
0 / - .287 - 

0 • .042 - 

0 .865 



Table 2. Observed links, number of items, number of individuals and standard 
errors of observed links 



Index Cell Observed Number of Number of Standard 

Subscripts ' Links Items Individuals error 



1 


1>2 


,1.013 




8 


322 ^ 


.069 


2 


2,3 


.015 




12 


322 


.056 


3 


2,4 \ 


1.608 




19 


. 285 


.048 


4 


■ 2,5 


.773 




12 


322 


.056 


5 


2,7 


2'. 509 




8" 


322 


~ .069 


6 


3,4 


1.310 




10' 


285 


.065 


7 


• ' 4,5 


1.084 


* 


12 


285 


.060 


8 


4,6 


2.441 




11 


285 


- .062 


9 


4,7 


.863 




20 


285 


.046 


10 . 


5,7 


1.684 


-9 


333 


.064 


11 


6,7 


.567 




8 


333. 


.068 


12 


7>8 


.980 




23 


312 


.040 


13 


. 7,11 


• 2.184 

L 




11 


328 


.058 


14 


8,9 


.246 




- 11" 


312 


.060- 


15 


8,10 


1.510 




12 


312 


.057 


16 


8,11 


1.323 




24 


312 


.040 


17 


10,11 


.172 




12 


328 


.OSB 


18 


il,12 


.844 




24 


294 


.042 
.058 


19 


. 11,15 


1.601 






.307 


20 


12,13 


..003 




8 ■ 


• 294 


.072 


21 


12,14 


' .614 ■ 




11 


294 


.061 


22 


12,15 


.844 




24 


294 


.042 


23 


14,15 


.510 




12 


307 


.058 


24 


15,16 


.483 




23 


307 


.042 


25 


15,19 


1.104 




, 11 


307 


.060 


26 


16,17. 


.233 




■71 ■ 12 


597 


.041 


27 


16 ,-18 


.558 




' 12 


597 


.041 


28 


16, i9 


.482 




36 V 


597 


.024 


29 


17,19 


.287 




10 


1,204 


.032 


30 


18,19 


.042 




12 


1,204 


.029 


31 


19,20 


.865 




12 


1,204 


.0.29 



fabl« 3. BAsldual analysis of linking cpnstants. 



Index 


Cell 
Subscx'iDta 


Qbsexved 
Links 


Predicted 
Links 


' 4f 

Rfisidual 


/ 

/ 

Residual Rankit 
(Std) 


1 


1-2 




X • UXJ 


• 000 


-.230 


-.202 


2 


* r 




« 1 DP 
• X^iO 


• X4 J 


.227 


.643' 




46 r 


X • QUO 


X .u^u 




1.588 


1.175 


4 


2.5 


771 


1 im J 




• -1.920 


-1.645 


5 


2.7 


2 ^nQ 


2 fiQ2 


• XO J 


-.814 


1.341 


6 


3.4 




1 1 67 

X . XQ # 


• XHw 


.227 


.643 


7 


4.5 


X • \JO*B 




ft "5 "5 


2.400 


2.054 


8 


4.6 


2 AAl 


1 7A1 


A7a 


1.939 


1.476 


9 






x*o>^ 


^ 7PQ 


-2.754 


-2.054 




5.7 


1 6nd 

X • uO*a 


X • ^ 7U 


OOA 


.710 


1.036 


11 


6r7 


• / 


— Ill 


<;7Q 


1.939 


1.476 


12 


7r8 . 


Qfln ■ 


RQn 

• C3 7U 


• U7U 


, .057 


.332 


13 


7.11 

» y AA 


^ • xa*B 


5 01 A 




-.516 


-1.175 


14 


8.9 


24fi 


OA^ 


• IJwU 


-.230 


"..202 


15 


8rl0 


X • 3XVJ 


X • OQU 


• XDU 


.248 


.842 


16 


8rll 


X • 


X • 000 


^ o6n 


-.4?2 


-.915 


17 


lOrll 


172 


099 




.248 


.842 


18 


llrl2 


844 


776 




-.013 


.253 


19 


llrlS 


1 601 


1 66Q 


. n6n \ 


. -.446 


-lJ.036 


20 


12,13 




001 
• Www 


000 


-.230 


-.202 


21 * 


12.14 


' 614 


4Q8* 


116 

• XXQ 


.141 


.468 


22 


12 r 15 


B44 




04R 


-.384 


-.706 


23 


14rl5 


510 




1 16 

• XXw 


.141 


.468 


24 


15rl6 


481 


<^91 




-.358 


-.583 


25 


15rl9 


1 1 o4 


X • U04 


r»4n 


-.101 


.151 


26 


16rl7 


.233 


.245 


-.012 


-.267 


-.468 


27 


16rl8 


.558 


.528 


- .030 


-.134 


.050 


28 




• *mO ^ 






-.416 


-.806 


29 


17,19 


.287 


.296 


-.012 


-.267 


-.468 


30 ' 


I8rl9 


.042 


.012 


.030 


-.134 


.050 


31 


19r20 


.865 


.865 


.000 


-.230 


-.202 


Mean 

Standard Deviation 


.927 
.682 


.856 
.703 


.072 
.313 


.000 
1.000 


.001 
.966 



\ 



Tablm 4. Test difficulties, re-centered links, and comparison with preliminary 
results of an earlier linking study. 



Test Cell Re--centered Test^ Preliminary^ Difference 

Difficulties Subscripts Links ' ID Links 

(delta) 



9.575 


6,1 


-3.815 


4-7 


-4.272 


.457 


8.562 


6,2 


-2.802 








8.690 


6,3 


-2.930 


5-7 


-3.625 


.695 


7.522 


6,4 


-1.763 








7.261 


. 6,5 


-1.510 


6-7 


-2.344 


.834 


5.760 


6,6 


0.000 


7-7 


0.000 


.000 


5.871 


6,7. 


* -.111 








4.980 


6,8 










4.734 


6,9 ^ 


1 1.026 


7-8 


.616 


.410 


:?.620 


6,10 A 


i 2.140 


7-9 . ' 


1.696 


.444 


3I597 


6,11 W 


mk- 2.162 ' 








2.821 


6,12 ^ 


rm- 2.939 








2.818 


6,13 ^ 


1 2.942 


.7-10 


2.235 


^07 


2.323 


6,14 


3.437 


7-11 


2.635 


.'©02 


1.929 


6,15 


3.831 








1.405 


6 ,16 


4.354 








1.161 


6,17 


4.599 


7-12 


3.375 


. 1".224 


.877 


6,18 , 


4.883 


7-13 


4.177 


.706 


.865 • 


6,19 


4.8^4 








-000 


6,20 


5.760 


7-14 


5.094 


.696 


Preliminary links were 6nly obtained for t^e 


Achievement 


Series and 


not . 


for the 


specially created 


linking tests. 


• 







a. The test numbers used through out this paper were assigned in order. to simplify 
the discussion in this paper. The numbers in this column show the actual form 
and level numbers used by Scott, Foresman and Company for their Achievement 
Series. 

. b. The values given in this column were based on a preliminaury analysis of the 
Huron County Data used in this paper. The values currently used by Scott, 
Foresman and Company are based on a national representative sample. This., 
data is not currently available. 
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ffteurt littwoylc of Bsadlng T«ata 



K 




B 



10 



111 



12 



14' 



17 



.10 



,?.0 



Grade 



5* 



7-8 



9-10 



11-12 
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riguv« a. Mnkit plot: of atiindardlaad e«aldu«la. 

^ I.I I I , , in I , ■ 



•Standardized Residuals 




